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SEQUENCE LISTING 

<110> Rajgarhia, Vineet 
Penttila, Merja 
Ruohonen , Laura 
Ilmen, Marja 
Koivuranta, Kari 

<120> Methods and materials for synthesis of organic products 
<13 0> MBHB00-1237-A 

<140> 09/992,430 
<141> 2001-11-23 

<150> 60/252541 
<151> 2000-11-22 

<160> 65 

<170> Patentln version 3.1 

<210> 1 
<211> 92 
<212> DNA 

<213>.- Artificial sequence 
<220> 

<223> Multiple cloning site 



COPY OF PAPERS 
ORIGINALLY FILED 



cccaagcttg aattccccgg gggatccctg cagggtacca cgcgtagatc tactagtgcg 



gccgcctcga gtctagaggg cccaagcttg gg 



60 
92 



<210> 2 

<211> 91 

<212> DNA 

<213> Artificial sequence 
<220> 

<223> Multiple cloning site 



<400> 2 

ttgg gccctctaga ctcgaggcgg ccgcactagt agatctacgc gtggtaccct 



ccaagc 



gcagggatcc cccggggaat tcaagcttgg g 



60 
91 



<210> 3 

<211> 31 

<212> DNA 

<213> Lactobacillus helveticus 



<400> 3 

ccgggatcca tggcaagaga ggaaaaacct c 



31 



1 




<210> 4 
<211> 32 
<212> DNA 

<213> Lactobacillus helveticus 
<400> 4 

ccaagatctt tattgacgaa ccttaacgcc ag 32 



<210> 5 
<211> 37 
<212> DNA 

<213> Pediococcus acidilact ici 
<400> 5 

ccgggatcca tgtctaatat tcaaaatcat caaaaag 37 



<210> 6 
<211> 33 
<212> DNA 

<213> Pediococcus acidilactici 
<400> 6 

ccaagatctt tatttgtctt gtttttcagc aag 33 



<210> 7 
<211> 82 
<212> DNA 

<213> Kluyveromyces marxianus 
<400> 7 

taaacagtac aatcgcaaag aaaagctcca cacccaaacc aaataattgc aatgcaactt 60 
cttttctttt tttttctttt ct 82 



<210> 8 
<211> 79 
<212> DNA 

<213> Kluyveromyces marxianus 
<400> 8 

ttataaaatc attaaaatcc aaaatcgtaa tttatctctt tatcctctcc ctctctacat 60 
gccggtagag gtgtggtca 7 9 



<210> 9 

<211> 1736 

<212> DNA 

<213> Unknown 

<220> 
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<223> Kanamycin resistance gene 
<400> 9 

gtacaacttg agcaagttgt cgatcagctc ctcaaattgg tcctctgtaa cggatgactc 60 

aacttgcaca ttaacttgaa gctcagtcga ttgagtgaac ttgatcaggt tgtgcagctg 120 

gtcagcagca tagggaaaca cggcttttcc taccaaactc aaggaattat caaactctgc 180 

aacacttgcg tatgcaggta gcaagggaaa tgtcatactt gaagtcggac agtgagtgta 240 

gtcttgagaa attctgaagc cgtattttta ttatcagtga gtcagtcatc aggagatcct 300 

ctacgccgga cgcatcgtgg ccgacctgca gggggggggg gggcgctgag gtctgcctcg 360 

tgaagaaggt gttgctgact cataccaggc ctgaatcgcc ccatcatcca gccagaaagt 420 

gagggagcca cggttgatga gagctttgtt gtaggtggac cagttggtga ttttgaactt 480 

ttgctttgcc acggaacggt ctgcgttgtc gggaagatgc gtgatctgat ccttcaactc 540 

agcaaaagtt cgatttattc aacaaagccg ccgtcccgtc aagtcagcgt aatgctctgc 600 

cagtgttaca accaattaac caattctgat tagaaaaact catcgagcat caaatgaaac 660 

tgcaatttat tcatatcagg attatcaata ccatattttt gaaaaagccg tttctgtaat 720 

gaaggagaaa actcaccgag gcagttccat aggatggcaa gatcctggta tcggtctgcg 780 

attccgactc gtccaacatc aatacaacct ttaatttccc ctcgtcaaaa ataaggttat 840 

caagtgagaa atcaccatga gtgacgactg aatccggtga gaatggcaaa agcttatgca 900 

ttctttccag acttgttcaa caggccagcc attacgctcg tcatcaaaat cactcgcatc 960 

aaccaaaccg ttattcattc gtgattgcgc ctgagcgaga cgaaatacgc gatcgctgtt 1020 

aaaaggacaa ttacaaacag gaatcgaatg caaccggcgc aggaacactg ccagcgcatc 1080 

aacaatattt tcacctgaat caggatattc ttctaatacc tggaatgctg ttttcccggg 1140 

gatcgcagtg gtgagtaacc atgcatcatc aggagtacgg ataaaatgct tgatggtcgg 1200 

aagaggcata aattccgtca gccagtttag tctgaccatc tcatctgtaa catcattggc 1260 

aacgctacct ttgccatgtt tcagaaacaa ctctggcgca tcgggcttcc catacaatcg 1320 

atagattgtc gcacctgatt gcccgacatt atcgcgagcc catttatacc catataaatc 1380 

agcatccatg ttggaattta atcgcggcct cgagcaagac gtttcccgtt gaatatggct 1440 

cataacaccc cttgtattac tgtttatgta agcagacagt tttattgttc atgatgatat 1500 

atttttatct tgtgcaatgt aacatcagag attttgagac acaacgtggc tttccccccc 1560 

ccccctgcag gtcggcatca ccggcgccac aggtgcggtt gctggcgcct atatcgccga 1620 
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catcaccgat ggggaagatc gggctcgcca cttcgggctc atgagcgctt gtttcggcgt 1680 

gggtatggtg gcaggccccg tggccggggg actgttgggc gccatctcct tgcatg 1736 

<210> 10 
<211> 372 
<212> DNA 

<213> Kluyveromyces marxianus 
<400> 10 

ccggttcttt ctcttactct tacaagacca agaacattgt cgaattccac tccgactaca 60 

tcaaggtcag aaacgccact ttcccaggtg tccaaatgaa gttcgtcttg caaaagttgt 120 

tgaccaaggt caaggatgct gctaagggtt acaagccagt tccagttcct cacgctccaa 180 

gagacaacaa gccagttgct gactctactc cattgaagca agaatgggtc tggactcaag 240 

tcggtaagtt cctacaagaa ggtgatgttg ttctaactga aaccggtacc tccgctttcg 300 

gtatcaacca aacccacttc ccaaatgaca cctacggtat ctcccaagtc ttgtggggtt 360 

ccattggttt ca 372 

<210> 11 
<211> 747 
<212> DNA 

<213> Kluyveromyces marxianus 
<400> 11 

ttaccactgt cttcggtctg ccaggtgact tcaatctgcg tctgttggac gagatctacg 60 

aggtcgaggg tatgagatgg gccggtaact gtaacgagtt gaacgcttct tacgctgccg 120 

acgcttacgc cagaatcaag ggtatgtcct gtttgatcac caccttcggt gtcggtgagt 180 

tgtccgcttt gaacggtatc gccggttctt acgctgagca cgtcggtgtc ttgcacattg 240 

tcggtgtccc atccgtctcc gcccaggcca agcagctatt gttgcaccac accttgggta 300 

acggtgactt cactgtcttc cacagaatgt ccgccaacat ctctgagacc actgctatga 360 

tcactgatct agctaccgcc ccatctgaga tcgacagatg tatcagaacc acctacatta 420 

gacagagacc tgtctacttg ggtttgccat ctaacttcgt tgaccagatg gtcccagcct 480 

ctctattgga caccccaatt gacttggcct tgaagccaaa cgaccagcag gctgaggagg 540 

aggtcatctc tactttgttg gagatgatca aggacgctaa gaacccagtc atcttggctg 600 

acgcttgcgc ttccagacac gatgtcaagg ctgagaccaa gaagttgatt gacatcactc 660 

agttcccatc tttcgttacc ccaatgggta agggttccat tgacgagaag cacccaagat 720 

tcggtggtgt ctacgtcggt accttgt 747 
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<210> 12 
<211> 1738 
<212> DNA 
<213> Unknown 

<220> 

<223> kanamycin resistance gene fragment 
<400> 12 

gtacaacttg agcaagttgt cgatcagctc ctcaaattgg tcctctgtaa cggatgactc 60 

aacttgcaca ttaacttgaa gctcagtcga ttgagtgaac ttgatcaggt tgtgcagctg 120 

gtcagcagca tagggaaaca cggcttttcc taccaaactc aaggaattat caaactctgc 180 

aacacttgcg tatgcaggta gcaagggaaa tgtcatactt gaagtcggac agtgagtgta 240 

gtcttgagaa attctgaagc cgtattttta ttatcagtga gtcagtcatc aggagatcct 300 

ctacgccgga cgcatcgtgg ccgacctgca gggggggggg gggcgctgag gtctgcctcg 3 60 

tgaagaaggt gttgctgact cataccaggc ctgaatcgcc ccatcatcca gccagaaagt 420 

gagggagcca cggttgatga gagctttgtt gtaggtggac cagttggtga ttttgaactt 480 

ttgctttgcc acggaacggt ctgcgttgtc gggaagatgc gtgatctgat ccttcaactc 540 

agcaaaagtt cgatttattc aacaaagccg ccgtcccgtc aagtcagcgt aatgctctgc 600 

cagtgttaca accaattaac caattctgat tagaaaaact catcgagcat caaatgaaac 660 

tgcaatttat tcatatcagg attatcaata ccatattttt gaaaaagccg tttctgtaat 720 

gaaggagaaa actcaccgag gcagttccat aggatggcaa gatcctggta tcggtctgcg 7 80 

attccgactc gtccaacatc aatacaacct attaatttcc cctcgtcaaa aataaggtta 840 

tcaagtgaga aatcaccatg agtgacgact gaatccggtg agaatggcaa aagcttatgc 900 

atttctttcc agacttgttc aacaggccag ccattacgct cgtcatcaaa atcactcgca 960 

tcaaccaaac cgttattcat tcgtgattgc gcctgagcga gacgaaatac gcgatcgctg 102 0 

ttaaaaggac aattacaaac aggaatcgaa tgcaaccggc gcaggaacac tgccagcgca 108 0 

tcaacaatat tttcacctga atcaggatat tcttctaata cctggaatgc tgttttcccg 1140 

gggatcgcag tggtgagtaa ccatgcatca tcaggagtac ggataaaatg cttgatggtc 1200 

ggaagaggca taaattccgt cagccagttt agtctgacca tctcatctgt aacatcattg 1260 

gcaacgctac ctttgccatg tttcagaaac aactctggcg catcgggctt cccatacaat 1320 

cgatagattg tcgcacctga ttgcccgaca ttatcgcgag cccatttata cccatataaa 1380 
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tcagcatcca tgttggaatt taatcgcggc ctcgagcaag acgtttcccg ttgaatatgg 1440 

ctcataacac cccttgtatt actgtttatg taagcagaca gttttattgt tcatgatgat 1500 

atatttttat cttgtgcaat gtaacatcag agattttgag acacaacgtg gctttccccc 1560 

ccccccctgc aggtcggcat caccggcgcc acaggtgcgg ttgctggcgc ctatatcgcc 1620 

gacatcaccg atggggaaga tcgggctcgc cacttcgggc tcatgagcgc ttgtttcggc 1680 

gtgggtatgg tggcaggccc cgtggccggg ggactgttgg gcgccatctc cttgcatg 1738 



<210> 13 

<211> 17 

<212> DNA 

<213> Artificial sequence 
<220> 

<223> degenerative amplification primers 

<400> 13 

gtbatyggyt chggtac 17 



<210> 14 

<211> 17 

<212> DNA 

<213> Artificial sequence 
<220> 

<223> degenerative amplification primers 

<400> 14 

swrtcdccrt gytcacc 17 



<210> 15 

<211> 22 

<212> DNA 

<213> Artificial sequence 
<220> 

<223> amplification primers 

<400> 15 

gtacagttct ggatactgct eg 22 



<210> 16 

<211> 18 

<212> DNA 

<213> Artificial sequence 
<220> 

<223> amplification primers 
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<400> 16 

acaggcatcg atgctgtc 
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<210> 17 
<211> 21 
<212> DNA 

<213> Kluyveromyces thermotolerans 
<400> 17 

ctacttggag ccactatcga c 21 



<210> 18 
<211> 19 
<212> DNA 

<213> Kluyveromyces thermotolerans 
<400> 18 

gtgatgtcgg cgatatagg 

<210> 19 
<211> 21 
<212> DNA 

<213> Kluyveromyces thermotolerans 
<400> 19 

gatctcctgc taagctcttg c 

<210> 20 
<211> 20 
<212> DNA 

<213> Kluyveromyces thermotolerans 
<400> 20 

gcagttttgg atattcatgc 



<210> 21 

<211> 972 

<212> DNA 

<213> Kluyveromyces thermotolerans 
<220> 

<221> CDS 

<222> (1) . . (972) 

<223> 



<400> 21 

atg ttc caa gat aca aag tct caa gca gta aga act gat gcc aaa aca 
Met Phe Gin Asp Thr Lys Ser Gin Ala Val Arg Thr Asp Ala Lys Thr 
! 5 10 15 

gta aaa gtt gtg gta gtg gga gtg gga agt gtt ggg tct gcc aca gcg 
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Val Lys Val Val Val Val Gly Val Gly Ser Val Gly Ser Ala Thr Ala 

20 25 30 

tat acg ttg ctt etc age ggc ate gtt tec gag att gtc ctt ate gac 
Tyr Thr Leu Leu Leu Ser Gly He Val Ser Glu He Val Leu He Asp 
35 40 45 

gtg aac aaa gac aaa gca gag ggt gaa age atg gac tta aac cac gca 
Val Asn Lys Asp Lys Ala Glu Gly Glu Ser Met Asp Leu Asn His Ala 
50 55 60 

gca cct tea aat aca agg tct cga gcg ggt gat tat cct gac tgc get 
Ala Pro Ser Asn Thr Arg Ser Arg Ala Gly Asp Tyr Pro Asp Cys Ala 
65 70 75 80 

ggc gcg gee att gtt att gtc aca tgt ggg att aac caa aaa aat gga 
Gly Ala Ala He Val He Val Thr Cys Gly He Asn Gin Lys Asn Gly 

85 90 95 

caa aca agg atg gat ctt get gca aaa aat gec aac att atg ctg gaa 
Gin Thr Arg Met Asp Leu Ala Ala Lys Asn Ala Asn He Met Leu Glu 

100 105 HO 

ate atc.ccc aat gtt gec aaa tat get cct gat acc ate ctg ctt att 
He He Pro Asn Val Ala Lys Tyr Ala Pro Asp Thr lie Leu Leu lie 

115 120 . 125 • • ~ ; : ;;- i .:;:v : :j;; 

gee acg aat cct gtc gat gtt ttg acc tat att age tat aag gcg tea 
Ala Thr Asn Pro Val Asp Val Leu Thr Tyr He Ser Tyr Lys Ala Ser - 
130 135 140 

ggg ttt cca eta age aga gtt ate ggc tea ggt aca gtt ctg gat act 
Gly Phe Pro Leu Ser Arg Val He Gly Ser Gly Thr Val Leu Asp Thr 
145 150 155 160 

get cgt ttt aaa tac ate etc gga gag cac ttc aag ate tea teg gac 
Ala Arg Phe Lys Tyr He Leu Gly Glu His Phe Lys He Ser Ser Asp 

165 170 175 

age ate gat gec tgt gta att gga gaa cat ggt gat teg ggt gtg cct 
Ser He Asp Ala Cys Val He Gly Glu His Gly Asp Ser Gly Val Pro 

180 185 190 

gtc tgg tct ctt acc aac ate gac ggc atg aag etc egg gat tac tgc 
Val Trp Ser Leu Thr Asn He Asp Gly Met Lys Leu Arg Asp Tyr Cys 
195 200 205 

gaa aaa gee aac cac ata ttt gat cag aat gcg ttc cat aga ate ttt 
Glu Lys Ala Asn His He Phe Asp Gin Asn Ala Phe His Arg He Phe 
210 215 220 

gag caa acg cga gac get get tac gat ate ate aag cgc aaa ggc tat 
Glu Gin Thr Arg Asp Ala Ala Tyr Asp He He Lys Arg Lys Gly Tyr 
225 230 235 240 

act tea tat gga ate gca gcg gga tta ctt cgc ata gta aag gcg att 
Thr Ser Tyr Gly He Ala Ala Gly Leu Leu Arg He Val Lys Ala He 



144 



192 



240 



288 



336 



384 



;432 



480 



528 



576 



624 



672 



720 



768 
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245 250 255 

tta gag gat aca gga tec aca ctt aca gtt tea ace gtt ggt gat tat 816 
Leu Glu Asp Thr Gly Ser Thr Leu Thr Val Ser Thr Val Gly Asp Tyr 

260 265 270 

ttt ggg gtt gaa caa att get ata age gtc cct ace aaa etc aat aaa 864 
Phe Gly Val Glu Gin lie Ala lie Ser Val Pro Thr Lys Leu Asn Lys 
275 280 285 

agt ggg get cat caa gtg get gaa ctt tea etc gat gag aag gaa ata 912 
Ser Gly Ala His Gin Val Ala Glu Leu Ser Leu Asp Glu Lys Glu lie 
290 295 300 

gaa ttg atg gaa aaa tea get agt cag ate aaa tea gtg att gag cat 960 
Glu Leu Met Glu Lys Ser Ala Ser Gin lie Lys Ser Val He Glu His 
305 310 315 320 

ctg gag ate aat 972 
Leu Glu He Asn 



<210> 22 

<211> 324 

<212> PRT ; 

<213> Kluyveromyces thermotolerans 

<400> 22 

Met Phe Gin Asp Thr Lys Ser Gin Ala Val Arg Thr Asp Ala Lys Thr 
1 5 10 15 



Val Lys Val Val Val Val Gly Val Gly Ser Val Gly Ser Ala Thr Ala 

20 25 30 



Tyr Thr Leu Leu Leu Ser Gly He Val Ser Glu He Val Leu He Asp 
35 40 45 



Val Asn Lys Asp Lys Ala Glu Gly Glu Ser Met Asp Leu Asn His Ala 
50 55 60 



Ala Pro Ser Asn Thr, Arg Ser Arg Ala Gly Asp Tyr Pro Asp Cys Ala 
65 70 75 80 



Gly Ala Ala He Val He Val Thr Cys Gly He Asn Gin Lys Asn Gly 

85 90 95 



Gin Thr Arg Met Asp Leu Ala Ala Lys Asn Ala Asn He Met Leu Glu 

100 105 110 
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lie lie Pro Asn Val Ala Lys Tyr Ala Pro Asp Thr lie Leu Leu lie 
115 120 125 



Ala Thr Asn Pro Val Asp Val Leu Thr Tyr lie Ser Tyr Lys Ala Ser 
130 135 140 



Gly Phe Pro Leu Ser Arg Val He Gly Ser Gly. Thr Val Leu Asp Thr 

145 150 155 160 

Ala Arg Phe Lys Tyr He Leu Gly Glu His Phe Lys He Ser Ser Asp 

165 170 175 



Ser He Asp Ala Cys Val He Gly Glu His Gly Asp Ser Gly Val Pro 

180 185 190 



Val Trp Ser Leu Thr Asn He Asp Gly Met Lys Leu Arg Asp Tyr Cys 
195 200 205 

Glu Lys Ala Asn His He Phe Asp Gin Asn Ala Phe His Arg lie Phe 
210 215 220 

Glu Gin Thr Arg Asp Ala Ala Tyr Asp He He Lys Arg Lys Gly Tyr 
225 230 235 240 

Thr Ser Tyr Gly He Ala Ala Gly Leu Leu Arg He Val Lys Ala He 

245 250 255 

Leu Glu Asp Thr Gly Ser Thr Leu Thr Val Ser Thr Val Gly Asp Tyr 

260 265 270 

Phe Gly Val Glu Gin He Ala He Ser Val Pro Thr Lys Leu Asn Lys 
275 280 285 

Ser Gly Ala His Gin Val Ala Glu Leu Ser Leu Asp Glu Lys Glu He 
290 295 300 

Glu Leu Met Glu Lys Ser Ala Ser Gin He Lys Ser Val He Glu His 
305 310 315 320 



Leu Glu He Asn 
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<210> 23 

<211> 20 

<212> DNA 

<213> Artificial sequence 



<220> 

<223> degenerate amplification primers 
<400> 23 

gtyggtgchg gtgchgthgg 



<210> 24 

<211> 17 

<212> DNA 

<213> Artificial sequence 



<220> 

<223> degenerate amplification primers 
<400> 24 

swrtcdccrt gytcbcc 



<210> 25 

<211> 27 

<212> DNA 

<213> Artificial 



sequence 



<220> 

<223> amplification primers 
<400> 25 

atccacaaca gcttacacgt tattgag 



<210> 


26 


<211> 


28 


<212> 


DNA 


<213> 


Artificial sequence 


<220> 




<223> 


amplification primers 


<400> 


26 



gtttggttgc tggaagtggt gttgatag 



<210> 27 

<211> 27 

<212> DNA 

<213> Artificial sequence 



<220> 

<223> amplification primers 
<400> 27 
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aacattgaat agcttgctca ggttgtg 27 



<210> 28 

<211> 28 

<212> DNA 

<213> Artificial sequence 
<220> 

<223> amplification primers 

<400> 28 

gataataaac gcgttgacat ttcagatg 28 



<210> 29 

<211> 939 

<212> DNA 

<213> Torulaspora pretoriensis 



<220> 

<221> CDS 

<222> (1) . . (939) 

<223> 



<400> 29 

atg cat aga tgt get aaa gtg gec ate gtc ggt gec ggc caa gtt gga 48 
Met His Arg Cys Ala Lys Val Ala lie Val Gly Ala Gly Gin Val Gly 
15 10 15 

tec aca aca get tac acg tta tta ttg agt agt ttg gtt get gaa gtg 96 
Ser Thr Thr Ala Tyr Thr Leu Leu Leu Ser Ser Leu Val Ala Glu Val 

20 25 30 

gtg ttg ata gat gtc gat aaa aga aag gtc gaa ggc caa ttt atg gat 144 
Val Leu lie Asp Val Asp Lys Arg Lys Val Glu Gly Gin Phe Met Asp 
35 40 45 

ctg aac cac gcg get cct tta acg aag gag tea cga ttc agt get ggg 192 
Leu Asn His Ala Ala Pro Leu Thr Lys Glu Ser Arg Phe Ser Ala Gly 
50 55 60 

gac tat gaa agt tgt get gat get gcg gtt gta ate gta acg ggc ggg 240 
Asp Tyr Glu Ser Cys Ala Asp Ala Ala Val Val lie Val Thr Gly Gly 
65 70 75 80 

get aat cag aaa cct ggt caa act aga atg gag eta gec gag agg aac 288 
Ala Asn Gin Lys Pro Gly Gin Thr Arg Met Glu Leu Ala Glu Arg Asn 

85 90 95 

gtt aaa ate atg cag gaa gtg ate cct aag att gtg aaa tac gec ccc 336 
Val Lys lie Met Gin Glu Val He Pro Lys He Val Lys Tyr Ala Pro 

100 105 110 

aac gca att ttg ctg att gca aca aac cct gtc gat gta ctt acc tat 384 
Asn Ala He Leu Leu He Ala Thr Asn Pro Val Asp Val Leu Thr Tyr 
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115 120 125 

get agt ttg aaa.gcg teg gga ttc cca gca age egg gtt att ggt tct 432 
Ala Ser Leu Lys Ala Ser Gly Phe Pro Ala Ser Arg Val lie Gly Ser 
130 135 140 

ggg aca gtt etc gac tct get cgt ata cag cac aac ctg age aag eta 480 
Gly Thr Val Leu Asp Ser Ala Arg lie Gin His Asn Leu Ser Lys Leu 
145 150 155 160 

ttc aat gtt tea tct gaa agt gtc aac gcg ttt att ate ggg gaa cat 528 
Phe Asn Val Ser Ser Glu Ser Val Asn Ala Phe lie lie Gly Glu His 

165 170 175 

ggt gac tea agt gtg ccc gtc tgg teg ctt get gag att gee ggc atg 576 
Gly Asp Ser Ser Val Pro Val Trp Ser Leu Ala Glu He Ala Gly Met 

180 185 190 

aaa gtg gag gat tac tgt agg cag tec aag aga aag ttt gac ccc age 624 
Lys Val Glu Asp Tyr Cys Arg Gin Ser Lys Arg Lys Phe Asp Pro Ser 
195 200 205 

att ctg acc aaa ata tat . gag gag teg cgt gac gcg gca gee tac ate 672 
He Leu Thr Lys He Tyr, Glu Glu Ser Arg Asp Ala Ala Ala Tyr He 
210 215 220 

ata gaa cgc aaa ggc tat acc aat ttc ggg att gca gca ggt ttg get 720 
He Glu Arg Lys Gly Tyr Thr Asn Phe Gly He Ala Ala Gly Leu Ala 
225 230 235 240 

agg ata gtg aga get att ctg aga gat gaa ggt gee eta tta act gtg 768 
Arg He Val Arg Ala lie Leu Arg Asp Glu Gly Ala Leu Leu Thr Val 

245 250 255 

tct act gta ggt gag cac ttt ggc atg aaa gat gtt tea ttg agt gtt 816 
Ser Thr Val Gly Glu His Phe Gly Met Lys Asp Val Ser Leu Ser Val 

260 265 270 

cca act agg gta gac agg age ggc get cac cat gtc gtc gac ctt ctg 864 
Pro Thr Arg Val Asp Arg Ser Gly Ala His His Val Val Asp Leu Leu 
275 280 285 

eta aac gac aag gag ctg gag caa att aaa aca tct gga gec aag ata 912 
Leu Asn Asp Lys Glu Leu Glu Gin He Lys Thr Ser Gly Ala Lys He 
290 295 300 

aag tea gee tgt gat gaa ctt ggc att 939 
Lys Ser Ala Cys Asp Glu Leu Gly He 
305 310 



<210> 30 

<211> 313 

<212> PRT 

<213> Torulaspora pretoriensis 

<400> 30 
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Met His Arg Cys Ala Lys Val Ala lie Val Gly Ala Gly Gin Val Gly 
1 5 10 15 



Ser Thr Thr Ala Tyr Thr Leu Leu Leu Ser Ser Leu Val Ala Glu Val 

20 25 30 



Val Leu lie Asp Val Asp Lys Arg Lys Val Glu Gly Gin Phe Met Asp 
35 40 45 



Leu Asn His Ala Ala Pro Leu Thr Lys Glu Ser Arg Phe Ser Ala Gly 
50 55 60 



Asp Tyr Glu Ser Cys Ala Asp Ala Ala Val Val lie Val Thr Gly Gly 
65 70 75 80 



Ala Asn Gin Lys Pro Gly Gin Thr Arg Met Glu Leu Ala Glu Arg Asn 

85 90 95 



Val Lys lie Met Gin Glu Val lie Pro Lys lie Val Lys Tyr Ala Pro 

100 105 110 



Asn Ala lie Leu Leu lie Ala Thr Asn Pro Val Asp Val Leu Thr Tyr 
115 120 125 



Ala Ser Leu Lys Ala Ser Gly Phe Pro Ala Ser Arg Val lie Gly Ser 
130 135 140 



Gly Thr Val Leu Asp Ser Ala Arg lie Gin His Asn Leu Ser Lys Leu 
145 150 155 160 



Phe Asn Val Ser Ser Glu Ser Val Asn Ala Phe lie lie Gly Glu His 

165 170 175 



Gly Asp Ser Ser Val Pro Val Trp Ser Leu Ala Glu lie Ala Gly Met 

180 185 190 



Lys Val Glu Asp Tyr Cys Arg Gin Ser Lys Arg Lys Phe Asp Pro Ser 
195 200 205 



lie Leu Thr Lys lie Tyr Glu Glu Ser Arg Asp Ala Ala Ala Tyr lie 
210 215 220 
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He Glu Arg Lys Gly Tyr Thr Asn Phe Gly He Ala Ala Gly Leu Ala 
225 230 235 240 



Arg He Val Arg Ala He Leu Arg Asp Glu Gly Ala Leu Leu Thr Val 

245 250 255 



Ser Thr Val Gly Glu His Phe Gly Met Lys Asp Val Ser Leu Ser Val 

260 265 270 



Pro Thr Arg Val Asp Arg Ser Gly Ala His His Val Val Asp Leu Leu 
275 280 285 



Leu Asn Asp Lys Glu Leu Glu Gin He Lys Thr Ser Gly Ala Lys He 
290 295 300 



Lys Ser Ala Cys Asp Glu Leu Gly He 
305 310 



<210> 31 

<211> 21 r ."• . . 

<212> DNA ;.: ' r 

<213> Bacillus megaterium 

<400> 31 

cctgagtcca cgtcattatt c 21 



<210> 32 
<211> 22 
<212> DNA 

<213> Bacillus megaterium 
<400> 32 

tgaagctatt tattcttgtt ac 22 



<210> 33 
<211> 27 
<212> DNA 

<213> Bacillus megaterium 
<400> 33 

gctctagatg aaaacacaat ttacacc 27 



<210> 34 

<211> 28 

<212> DNA 

<213> Bacillus megaterium 

<400> 34 
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atggatcctt acacaaaagc tctgtcgc 



28 



<210> 35 

<211> 26 

<212> DNA 

<213> Rhizopus oryzae 



<400> 35 

ctttattttt ctttacaata taattc 



26 



<210> 36 

<211> 19 

<212> DNA 

<213> Rhizopus oryzae 



<400> 36 

actagcagtg caaaacatg 



19 



<210> 37 
<211> 29 
<212> DNA 

<213> Rhizopus oryzae 
<400> 37 

gctctagatg gtattacact caaaggtcg 



29 



<210> 38 

<211> 30 

<212> DNA 

<213> Rhizopus oryzae 



<400> 38 

gctctagatc aacagctact tttagaaaag 



30 



<210> 39 

<211> 28 

<212> DNA 

<213> Artificial sequence 



<220> 
<223> 



cloning site sequence 



<400> 39 

aaatctagat gagccatatt caacggga 



28 



<210> 40 

<211> 29 

<212> DNA 

<213> Artificial sequence 



<220> 
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<223> cloning site sequence 
<400> 40 

ccggatcctt agaaaaactc atcgagcat 2 9 



<210> 41 
<211> 36 
<212> DNA 

<213> Kluyveromyces thermotolerans 
<400> 41 

gctctagaat tatgttccaa gatacaaagt ctcaag 3 6 



<210> 42 
<211> 34 
<212> DNA 

<213> Kluyveromyces thermotolerans 
<400> 42 

ccggaattca tcctcaattg atctccagat gctc 34 



<210> 43 
<211> 2229 
<212> DNA 

<213> Kluyveromyces thermotolerans 
<400> 43 

gcggccgcgg atcgctcttc cgctatcgat taattttttt ttctttcctc tttttattaa 60 

ccttaatttt tattttagat tcctgacctt caactcaaga cgcacagata ttataacatc 120 

tgcacaatag gcatttgcaa gaattactcg tgagtaagga aagagtgagg aactatcgca 180 

tacctgcatt taaagatgcc gatttgggcg cgaatccttt attttggctt caccctcata 240 

ctattatcag ggccagaaaa aggaagtgtt tccctccttc ttgaattgat gttaccctca 300 

taaagcacgt ggcctcttat cgagaaagaa attaccgtcg ctcgtgattt gtttgcaaaa 360 

agaacaaaac tgaaaaaacc cagacacgct cgacttcctg tcttcctatt gattgcagct 420 

tccaatttcg tcacacaaca aggtcctagc gacggctcac aggttttgta acaagcaatc 480 

gaaggttctg gaatggcggg aaagggttta gtaccacatg ctatgatgcc cactgtgatc 540 

tccagagcaa agttcgttcg atcgtactgt tactctctct ctttcaaaca gaattgtccg 600 

aatcgtgtga caacaacagc ctgttctcac acactctttt cttctaacca agggggtggt 660 

ttagtttagt agaacctcgt gaaacttaca tttacatata tataaacttg cataaattgg 720 

tcaatgcaag aaatacatat ttggtctttt ctaattcgta gtttttcaag ttcttagatg 780 

ctttcttttt ctctttttta cagatcatca aggaagtaat tatctacttt ttacaacaaa 840 
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tctagaatta tgttccaaga tacaaagtct caagcagtaa gaactgatgc caaaacagta 900 

aaagttgtgg tagtgggagt gggaagtgtt gggtctgcca cagcgtatac gttgcttctc 960 

agcggcatcg tttccgagat tgtccttatc gacgtgaaca aagacaaagc agagggtgaa 1020 

agcatggact taaaccacgc agcaccttca aatacaaggt ctcgagcggg tgattatcct 1080 

gactgcgctg gcgcggccat tgttattgtc acatgtggga ttaaccaaaa aaatggacaa 1140 

acaaggatgg atcttgctgc aaaaaatgcc aacattatgc tggaaatcat ccccaatgtt 1200 

gccaaatatg ctcctgatac catcctgctt attgccacga atcctgtcga tgttttgacc 1260 

tatattagct ataaggcgtc agggtttcca ctaagcagag ttatcggctc aggtacagtt 1320 

ctggatactg ctcgttttaa atacatcctc ggagagcact tcaagatctc atcggacagc 1380 

atcgatgcct gtgtaattgg agaacatggt gattcgggtg tgcctgtctg gtctcttacc 1440 

aacatcgacg gcatgaagct ccgggattac tgcgaaaaag ccaaccacat atttgatcag 1500 

aatgcgttcc atagaatctt tgagcaaacg cgagacgctg cttacgatat catcaagcgc ^560 

aaaggctata cttcatatgg aatcgcagcg ggattacttc gcatagtaaa ggcgatttta 1620 

gaggatacag gatccacact tacagtttca accgttggtg attattttgg ggttgaacaa 1680 

attgctataa gcgtccctac caaactcaat aaaagtgggg ctcatcaagt ggctgaactt 1740 

tcactcgatg agaaggaaat agaattgatg gaaaaatcag ctagtcagat caaatcagtg 1800 

attgagcatc tggagatcaa ttgaggatga attcggatcc ggtagataca ttgatgctat 1860 

caatccagag aactggaaag attgtgtagc cttgaaaaac ggtgaaactt acgggtccaa 1920 

gattgtctac agattttcct gatttgccag cttactatcc ttcttgaaaa tatgcactct 1980 

atatctttta gttcttaatt gcaacacata gatttgctgt ataacgaatt ttatgctatt 2040 

ttttaaattt ggagttcagt gataaaagtg tcacagcgaa tttcctcaca tgtagggacc 2100 

gaattgttta caagttctct gtaccaccat ggagacatca aaaattgaaa atctatggaa 2160 

agatatggac ggtagcaaca agaatatagc acgagccgcg gatttatttc gttacgcatg 2220 

cgcggccgc 2229 



<210> 44 
<211> 32 
<212> DNA 

<213> Candida sonorensis 
<400> 44 

tggactagta aaccaacagg gattgcctta gt 3 2 
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<210> 45 

<211> 33 

<212> DNA 

<213> Candida sonorensis 



<400> 45 

ctagtctaga gatcattacg ccagcatcct agg 



33 



<210> 46 

<211> 37 

<212> DNA 

<213> Candida albicans 



<400> 46 

gcgatctcga ggtcctagaa tatgtatact aatttgc 



37 



<210> 47 

<211> 36 

<212> DNA 

<213> Candida albicans 



<400> 47 

acttggccat ggtgatagtt attcttctgc aattga 



36 



<210> 48 

<211> 20 

<212> DNA 

<213> Saccharomyces cerevisiae 



<400> 48 

tgtcatcact gctccatctt 



20 



<210> 49 

<211> 20 

<212> DNA 

<213> Saccharomyces cerevisiae 



<400> 49 

ttaagccttg gcaacatatt 



20 



<210> 50 

<211> 37 

<212> DNA 

<213> Candida albicans 



<400> 50 

gcgatctcga ggtcctagaa tatgtatact aatttgc 



37 



<210> 51 
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<211> 39 
<212> DNA 

<213> Candida albicans 
<400> 51 

cgcgaattcc catggttagt ttttgttgga aagagcaac 



<210> 52 
<211> 32 
<212> DNA 

<213> Candida sonorensis 
<400> 52 

tggactagta aaccaacagg gattgcctta gt 



<210> 53 
<211> 33 
<212> DNA 

<213> Candida sonorensis 
<400> 53 

ctagtctaga gatcattacg ccagcatcct agg 



<210> 54 
<211> 44 
<212> DNA 

<213> Candida sonorensis 
<400> 54 

ccggaattcg atatctgggc wggkaatgcc aaygarttra < 



<210> 55 

<211> 44 

<212> DNA 

<213> Candida sonorensis 
<220> 

<223> primer that does not encode amino aci 
<220> 

<221> misc_f eature 

<222> (21) . . (21) 

<223> n stands for any nucleotide 



<220> 

<221> misc_feature 

<222> (33) . . (33) 

<223> n stands for any nucleotide 



<400> 55 

cgcggattca ggcctcagta ngaraawgaa ccngtrttra 



20 
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<210> 56 

<211> 10 

<212> PRT 

<213> Candida sonorensis 

<400> 56 

Trp Ala Gly Asn Ala Asn Glu Leu Asn Ala 
i 5 10 



<210> 57 
<211> 10 
<212> PRT 

<213> Candida sonorensis 
<400> 57 

Asp Phe Asn Thr Gly Ser Phe Ser Tyr Ser 



<210> 58 
<211> 18 
<212> DNA 

<213> Candida sonorensis 

<400> 58 !8 
tctgttmcct acrtaaga 



<210> 59 
<211> 20 
<212> DNA 

<213> Candida sonorensis 

<400> 59 20 
gtyggtggtc acgaaggtgc 



<210> 60 
<211> 36 
<212> DNA 

<213> Candida sonorensis 

<400> 60 36 
gcgatctcga gaaagaaacg acccatccaa gtgatg 



<210> 61 

<211> 68 

<212> DNA 

<213> Candida sonorensis 

<400> 61 
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* 



tggactagta catgcatgcg gtgagaaagt agaaagcaaa cattgtatat agtcttttct 60 
attattag 68 



<210> 62 
<211> 34 
<212> DNA 

<213> Candida sonorensis 
<400> 62 

gcgatctcga gaaaatgtta ttataacact acac 34 



<210> 63 
<211> 75 
<212> DNA 

<213> Candida sonorensis 
<400> 63 

tggactagta catgcatgcg gtgagaaagt agaaagcaaa cattttgttt gatttgtttg 60 
ttttgttttt gtttg 75 



<210> 64 
<211> 36 
<212> DNA 

<213> Candida sonorensis 
<400> 64 

gcgatctcga gaaagaaacg acccatccaa gtgatg 3 6 



<210> 65 
<211> 35 
<212> DNA 

<213> Candida sonorensis 
<400> 65 

acttggccat ggtatatagt cttttctatt attag 35 
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